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Abstract 

In 1976, Dekking showed that there exists an infinite binary word that contains 
neither squares yy with \y\ > 4 nor cubes xxx. We show that 'cube' can be replaced 
by any fractional power > 5/2. We also consider the analogous problem where '4' is 
replaced by any integer. This results in an interesting and subtle hierarchy. 

1 Introduction 

A square is a nonempty word of the form yy, as in the English word murmur. It is easy to 
see that every word of length > 4 constructed from the symbols and 1 contains a square, 
so it is impossible to avoid squares in infinite binary words. However, in 1974, Entringer, 
Jackson, and Schatz proved the surprising fact that there exists an infinite binary word 
containing no squares yy with \y\ > 3. Further, the bound 3 is best possible. 

A cube is a nonempty word of the form xxx, as in the English sort-of-word shshsh. An 
overlap is a word of the form axaxa, where a is a single letter and a; is a (possibly empty) 
word, as in the French word entente. Dekking showed that there exists an infinite binary 
word that contains neither squares yy with |?/| > 4 nor cubes xxx. Furthermore, the bound 
4 is best possible. He also proved that every overlap-free word contains arbitrarily large 
squares. 

These two results suggest the following natural question: for each length / > 1, determine 
the fractional exponent p (if it exists) such that 

(a) there is no infinite binary word simultaneously avoiding squares yy with \y\ > I and 
fractional powers x*^ with e > p; 
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(b) there is an infinite binary word simultaneously avoiding squares yy with \y\ > I and 
fractional powers with e> pi 

Here we say a word w is an e'th power (e rational) if there exist words y' G S* such 
that w = y"'y', and y' is a prefix of y with + |?/'|/|?/| = e. For example, the English word 
abracadabra is an y-power. We say a word avoids p powers if it contains no subword of 
the form with e > p. We say a word avoids p'^ powers if it contains no subword of the 
form y'^ with e > p. 

In this paper we completely resolve this question. It turns out there is a rather subtle 
hierarchy depending on I. The results are summarized in Table 1. 
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Figure 1: Summary of Results 

More precisely, we have 
Theorem 1 

(a) There are no infinite binary words that avoid all squares yy with \y\ > 2. 

(b) There are no infinite binary words that simultaneously avoid all squares yy with \y\ > 3 
and cubes xxx. 

(c) There is an infinite binary word that simultaneously avoids all squares yy with \y\ > 3 
and 3+ powers. 

(d) There is an infinite binary word that simultaneously avoids all squares yy with \y\ > 4 
and I powers. 

(e) There are no infinite binary words that simultaneously avoid all squares yy with \y\>Q 
and I powers. 

(f) There is an infinite binary word that simultaneously avoids all squares yy with \y\ > 7 
and I powers. 

(g) For all t > 1, there are no infinite binary words that simultaneously avoid all squares yy 
with \y\ > t and | powers. 

The result (a) is originally due to Entringer, Jackson, and Schatz [Hj. The result (b) is 
due to Dekking The result (g) appears in a recent paper of the author and J. Karhumaki 
We mention them for completeness. The remaining results are new. 
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2 Proofs of the negative results 



We say a word avoids {l,p) if it simultaneously avoids squares yy with \y\ > I and p powers. 

The negative results (a), (b), and (e) can be proved purely mechanically. The idea is as 
follows. Given / and p, we can create a tree T = T{l,p) of all binary words avoiding 
as follows: the root of T is labeled e. If a node is labeled w and avoids then it is 

an internal node with two children, where the left child is labeled wO and the right child is 
labeled wl. If it does not avoid then it is an external node (or "leaf"). 

It is now easy to see that no infinite word avoiding exists if and only if T{l,p) is 
finite. In this case, a breadth-first search will suffice to resolve the question. Furthermore, 
certain parameters of T{l,p) correspond to information about the finite words avoiding {l,p): 

• the number of leaves n is one more than the number of internal nodes, and so n — 1 
represents the total number of finite words avoiding {l,p); 

• if the height of the tree (i.e., the length of the longest path from the root to a leaf) is 
h, then h is the smallest integer such that there are no words of length > h avoiding 

{i,py, 

• the internal nodes at depth h — 1 gives the all words of maximal length avoiding {l,p). 

The following table lists {l,p, n, h, t, S), where 

• I = \y\, where one is trying avoiding yy; 

• p, the fractional exponent one is trying to avoid; 

• n, the number of leaves of T{l,p); 

• h, the height of the tree T{l,p). 

• t, the number of internal nodes at depth h — 1 in the tree. 

• S, the set of labels of the internal nodes at depth h — 1 that start with 0. (The other 
words can be obtained simply by interchanging and 1.) 

For completeness, we give the results for the optimal exponents for 2 < I < 7. As mentioned 
above, the case / = 2 is due to Entringer, Jackson, and Schatz jSj and the case / = 3 is due 
to Dekking 0. 
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{00101101001011001001101100101101001101100100110100101100100110110010110100110110011, 
00110010011010010110010011011001011010011011001001101001011001001101100101101001011} 
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7/3 


3548 


43 
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{001011001011010011001011001101001011001011} 



Figure 2: Proofs of the negative results (a), (b), (e) 



3 Proof of (c) 

In this section we prove that there is an infinite binary word that simultaneously avoids yy 
with ||/| > 3 and 3"*" powers. 

We introduce the following notation for alphabets: := {0, 1, . . . , A; — 1}. 

Let the morphism / : E3 — > E2 be defined as follows. 

^ 0010111010 

1 0010101110 

2 ^ 0011101010 

We will prove 

Theorem 2 If w is any squarefree word over S3, then f{w) avoids yy with \y\ > 3 and 3"*" 
powers. 

Proof. We argue by contradiction. Let w = aia2 • • • a„ be a squarefree string such that 
f{w) contains a square yy with \y\ > 3, i.e., /{w) = xyyz for some x,z E S2, y G {E^^. 
Without loss of generality, assume that w is a shortest such string, so that < |a;|, \z\ < 20. 

Case 1: \y\ < 20. In this case we can take \w\ < 5. To verify that f{w) has no squares 
yy with \y\ > 3, it therefore suffices to check each of the 30 possible words w e T,^. 

Case 2: \y\ > 20. First, we establish the following result. 

Lemma 3 (a) (inclusion property) Suppose f{ab) = tf{c)u for some letters a,b,c e E2 
and strings it e E2. Then this inclusion is trivial (that is, t — e or u — e). 

(b) (interchange property) Suppose there exist letters a,b,c and strings s,t,u,v such that 
/(a) = st, f{b) = uv, and f{c) — sv. Then either a — c orb — c. 

Proof. 
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(a) A short computation verifies there are no a, b, c for which the equality f{ab) = tf{c)u 
holds nontrivially. 

(b) This can also be verified with a short computation. If |s| > 6, then no two distinct 
letters share a prefix of length 6. If |s| < 5, then \t\ > 5, and no two distinct letters 
share a suffix of length 5. 



Once Lemma 01 is established, the rest of the argument is fairly standard. It can be found, 
for example, in but for completeness we repeat it here. 

For i = 1,2, ... ,n define Ai = f{ai). Then if f{w) = xyyz, we can write 

f{w) = A^A2 ■■■An = J^^KM ■ ■ ■ ■ ■ ■ A„_i« 



where 



A, 

An, 



X 

y 

z 



A' A" 

A'lA^ ■ 
A" 



Aj_iA'j — AjAj^i ■ ■ ■ An-iA'^ 



where > 0. See Figure El 





A'I 




4 


A'^ 




A' 


A" 


Ai 


A2 ■■■ Aj^i 


A, 


Aj+l ■ ■ ■ An-l 


A-n 


X 


y 


y 


Z 



Figure 3: The string xyyz within j[w) 

If > then Aj+i = f{aj-^^i) is a subword of A'{A2, hence a subword of A1A2 
f{aia2). Thus we can write = ^j+2^j+2 with 

A'lA2 = A"-Ajj^iA'-j^2- 

See Figure m 
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Figure 4: The case \A'i\ > \A'-\ 
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But then, by Lemma 01 (a), either \Aj \ = 0, or \A'-[\ = \A'j\, or A'j_^2 is a not a prefix of 
any f{d). All three conclusions are impossible. 

If \A"\ < \Aj\, then A2 = f{a2) is a subword of AjAj+i, hence a subword of AjAj+i = 
f{ajaj-^i). Thus we can write ^3 = ^3^43 with 



A'lA^A', = a';a,+^. 



See Figure El 
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Figure 5: The case \A'[\ < \A'-\ 

By Lemmaini(a), either \A'{\ = or \A'[\ = \Aj \ or A'^ is not a prefix of any f{d). Again, 
all three conclusions are impossible. 

Therefore \A'l\ = \A'^\. Hence A'l = A"-, A2 = Aj+i, . . ., Aj^i = A^-i, and A'j = A'^. 
Since h is injective, we have 02 = ctj+i, . . . , aj_i = a„_i. It also follows that \y\ is divisible 
by 10 and Aj = A'jAj = A'^A". But by Lemma El (b), either (1) aj = an or (2) aj = ai. 
In the first case, 02 ■ ■ ■ cLj-idj = flj+i ■ ■ ■ a„-ia„, so w contains the square (02 ■ ■ ■ aj-iajY, 
a contradiction. In the second case, ai ■ ■ ■ aj_i = ajOj+i ■ ■ ■ a^-i, so w contains the square 
(ai ■ ■ ■ aj^iY, a contradiction. 

It now follows that if w is squarefree then f{w) avoids squares yy with \y\ > 3. 

It remains to see that f{w) avoids 3"*" powers. If f{w) contained for some fractional 
exponent e > 3, then it would contain x^, so from above we have |x| < 2. Thus it suffices to 
show that f{w) avoids the words 0000, 1111, 0101010, 1010101. This can be done by a short 
computation. ■ 

Corollary 4 There is an infinite binary word avoiding squares yy with \y\ > 3 and 
powers. 

Proof. As is very well-known, there are infinite squarefree words over E3 [30]. Take any 
such word w (for example, the fixed point of the morphism 2 — > 210, 1 —* 20, — > 1), and 
apply the map /. The resulting word /(w) avoids (3,3"'"). ■ 

It may be of some interest to explain how the morphism / was discovered. We iteratively 
generated all words of length 1,2,3,... (up to some bound) that avoid (3,3"*"). We then 
guessed such words were the image of a /c-uniform morphism applied to a squarefree word 
over S3. For values of /c = 2, 3, . . ., we broke up each word into contiguous blocks of size k, 
and discarded any word for which there were more than 3 blocks. For certain values of k, 
this procedure eventually resulted in words fitting the criteria. At this point we knew a 
fc-uniform morphism cannot work, so we increased k and started over. Eventually a k was 
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found for which the number of such words appeared to increase without bound. We then 
examined the possible sets of 3 fc-blocks to see if any satisfied the requirements of Lemma 121 
This gave our candidate morphism /. 

Theorem 5 Let denote the number of binary words of length n avoiding yy with |?/| > 3 
and 3+ powers. Then = n{1.0V') and A„ = 0(1.49"). 

Proof. Grimm |4j has shown there are squarefree words over S3, where A = 1.109999. 

Since the map / is 10-uniform, it follows that An = niX^^^^^) = n{1.0r). 

For the upper bound, we reason as follows. The set of binary words of length n avoiding 
yy with \y\ > 3 and 3"*" powers is a subset of the set of binary words avoiding 0000 and 
1111. The number A'^ of words avoiding 0000 and 1111 satisfies the linear recurrence A'^ = 
A'n-i + + n > 4. It follows that A'^ = 0(a"), where a is the largest zero 

of — — X — 1, the characteristic polynomial of the recurrence. Here a < 1.84, so 
An = 0(1.84"). 

This reasoning can be extended using a symbolic algebra package such as Maple. Noonan 
and Zeilberger jH] have written a Maple package DAVID_IAN that allows one to specify a list 
L of forbidden words, and computes the generating function enumerating words avoiding 
members of L. We used this package for a list L of 62 words of length < 12: 

0000, 1111,..., 111010111010 

obtaining a characteristic polynomial of degree 67 with dominant zero = 1.4895. ■ 

4 Proof of (e) 

In this section we prove that there is an infinite binary word that simultaneously avoids yy 
with \y\ > 4 and |^ powers. 

Let (7i : Sg ^ S2 be defined as follows. 

^ 0011010010110 

1 0011010110010 

2 0011011001011 

3 ^ 0100110110010 

4 ^ 0110100101100 

5 1001101011001 

6 1001101100101 

7 1010011011001 
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Let (^2 : — > Eg be defined as follows. 

^ 03523503523453461467 

1 ^ 03523503523453467167 

2 ^ 16703523503523461467 

3 ^ 03523503523461467167 

Let (73 : E3 — > E4 be defined as follows. 

^ 010203 

1 ^ 010313 

2 ^ 021013 

Finally, define : Eg ^ Eg by (7 = (71 o (72 ° S's- Note that g is 1560- uniform. 
We will prove 

Theorem 6 If w is any squarefree word over E3, then g{w) avoids yy with \y\ > 4 and I"*" 
powers. 

Proof. The proof is very similar to the proof of Theorem |2l and we indicate only what 
must be changed. 

First, it can be checked that Lemma El also holds for the morphism g. 

As before, we break the proof up into two parts: the case where g{w) = xyyz for some 
y with 4 < |?/| < 2 ■ 1560, and the case where g{w) = xyyz for some y with \y\ > 2 ■ 1560. 
The former can be checked by examining the image of the 30 squarefree words in Eg under 
g. The latter is handled as we did in the proof of Theorem |2l We checked these conditions 
with programs written in Pascal; these are available from the author on request. ■ 

Corollary 7 There is an infinite binary word avoiding squares yy with \y\ > 4 and I"*" 
powers. 

It may be of some interest to explain how the morphisms gi, g2, (73, were discovered. 

We used a procedure analogous to that described above in Section |21 However, since 
it was not feasible to generate all words avoiding (4, 1"*") and having at most 3 contiguous 
blocks of length 1560, we increased the alphabet size and and tried various fc-blocks until we 
found a combination of alphabet size and block size for which the number of words appeared 
to increase without bound. We then obtained a number of possible candidates for blocks. 

Next, we determined the necessary avoidance properties of the blocks given by images of 
letters under gi. For example, 5^1(0) cannot be followed by (71 (1), because this results in the 
subword 000, which is a 3rd power (and 3 > 2.5). The blocks that must be avoided include 
all words with squares, and 

01, 02, 04, 05, 06, 07, 10, 12, 13, 17, 20, 21, 24, 25, 26, 27, 30, 31, 32, 36, 37, 40, 41, 42, 43, 47, 
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51, 54, 56, 57, 60, 62, 63, 64, 65, 72, 73, 74, 75, 76, 034, 145, 153, 161, 353, 450, 452, 535, 615, 616, 

714, 715, 2346703, 5234670, 5234671, 53467035, 6703523461, 2346146703503, 5234614670350 

This list was computed purely mechanically, and it is certainly possible that this list is not 
exhaustive. 

We now iterated our guessing procedure, looking for a candidate uniform morphism that 
creates squarefree words avoiding the patterns in the list above. This resulted in the 20- 
uniform morphism g2. 

We then computed the blocks that must be avoided for g2. This was done purely mechan- 
ically. Our procedure suggested that arbitrarily large blocks must be avoided, but luckily 
they (apparently) had a simple finite description: namely, we must avoid 12, 23, 32, and 
blocks of the form 2x0x1 and 3x1x0 for all nonempty words x, in addition to words with 
squares. 

We then iterated our guessing procedure one more time, looking for a candidate uniform 
morphism that avoids these patterns. This gave us the morphism g^. 

Of course, once the morphism g = gi o g2 ° 93 is discovered, we need not rely on the list 
of avoidable blocks; we can take the morphism as given and simply verify the properties of 
inclusion and interchange as in Lemma El 

Theorem 8 Let denote the number of binary words of length n avoiding yy with > 4 
and 1^ powers. Then 5„ = fi(l. 000066") and 5„ = 0(1.122"). 

Proof. The proof is analogous to that of Theorem^! We use the fact that g is 1560-uniform, 
which, combined with the result of Grimm j3] , gives the bound 1.109999^/1^*^° = 1.000066899. 

For the upper bound, we again use the Noonan-Zeilberger Maple package. We used the 
54 patterns corresponding to words of length < 20. This gave us a polynomial of degree 27 
with dominant zero = 1.12123967. ■ 

5 Proof of (f ) 

In this section we prove that there is an infinite binary word that simultaneously avoids yy 
with \y\>l and I"*" powers. 

Let /ii : ^ be defined as follows. 

-> 00110100101100 

1 00110100110010 

2 01001100101100 

3 10011011001011 

4 11010011011001 
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Let /i2 : S3 Eg be defined as follows. 

032303241403240314 

1 032314041403240314 

2 ^ 032414032303240314 

Finally, define /i : S3 — Eg by /i = /ii o /i2. Note that h is 252-uniform. 
We will prove 

Theorem 9 If w is any squarefree word over S3, then h{w) avoids yy with \y\ > 7 and 
powers. 
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Proof. Again, the proof is quite similar to that of Theorem |2l We leave it to the reader 
to verify that the inclusion and interchange properties hold for /;,, and that the image of all 
the squarefree words of length < 5 are free of squares yy with \y\ < 7 and |^ powers. ■ 

Corollary 10 There is an infinite binary word avoiding squares yy with \y\ > 7 and 
powers. 

The morphisms /ii,/i2 were discovered using the heuristic procedure mentioned in Sec- 
tional The avoiding blocks for hi were heuristically discovered to include 

01, 02, 10, 12, 13, 20, 21, 34, 42, 43, 304, 23031, 24041, 231403141, 232403241 

as well as blocks containing any squares. Then /i2 was constructed to avoid these blocks. 

Theorem 11 Let Cn denote the number of binary words of length n avoiding yy with \y\>7 
and 1^ powers. Then C„ = (](1.0004") and Cn = 0(1.162"). 

Proof. The proof is very similar to that of Theorems El and |H1 

For the lower bound, note that h is 252-uniform. This, combined with the bound of 
Grimm pi, gives a lower bound of (](A") for all A < 1.109999^/^52 ^ 1. 0004142. 

For the upper bound, we again used the Noonan-Zeilberger Maple package. We avoided 
58 words of length < 20. This resulted in a polynomial of degree 26, with dominant zero 
= 1.1615225. ■ 



6 Enumeration results 

In this section we provide a table of the first values of the sequences An, Bn, and Cn, defined 
in Sections 01 El andEl for 1 < n < 25. 
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n 


An 


Bfi 







1 


1 


1 


1 


2 


2 


2 


2 


4 


4 


4 


3 


8 


6 


6 


4 


14 


10 


10 


5 


26 


16 


14 


6 


42 


24 


20 


7 


68 


36 


30 


8 


100 


46 


38 


9 


154 


64 


50 


10 


234 


74 


64 


11 


356 


88 


86 


12 


514 


102 


108 


13 


768 


114 


136 


14 


1108 


124 


164 


15 


1632 


140 


196 


16 


2348 


160 


226 


17 


3434 


178 


264 


18 


4972 


198 


322 


19 


7222 


212 


384 


20 


10356 


230 


436 


21 


14962 


256 


496 


22 


21630 


294 


578 


23 


31210 


342 


674 


24 


44846 


366 


754 


25 


64584 


392 


850 



Figure 6: Values of A^, B^, Cn, < n < 25 
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